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Automatic  Verification  of  Hypothesised  Phonemic 
Strings  in  Continuous  Speech 


ABSTRACT 


A  procedure  is  described  for  mapping  a  phonemic  string  onto  a 
computer  .^representation  of  an  interval  of  continuous  speech. 

The  procedure  has  as  its  inputs  a  hypothesized  phonemic  string 
and  a  matrix  of  labels  and  label  scores  obtained  by  computer 
analysis  of  a  complete  utterance.  These  two  inputs  are  mapped 
by  an  iterative  technique  which  ())  .'earches  for  individual 
phonemes  in  an  appropriate  older,  not  necessarily  left-to- 
right;  (2)  continuously  adjusts  ror  phoneme  duration  varia¬ 
tions;  and  (3)  can  deal  with  intraword  coarticulation  effects. 
The  mapping  procedure  yields  estimated  time  boundaries  and  a 
goodnes3-of -match  score  for  the  hypothesized  phoneme  string. 

A  computer  program  has  been  written  that  uses  this  algorithm 
to  perform  automatic  recognition  of  words  and  phrases  within  an 
operating  continuous  speech  understanding  system.  Examples 
and  current  results  from  this  system  are  presented. 
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1.  INTRODUCTION 

Phor.emi'  -acoustic  mapping  is  the  process  of  matching  a  phonemic  string  to  a 
computer  analysis  of  a  portion  of  a  speech  signal.  This  document  describes 
the  algorithms  used  in  the  mapping  program  of  System  Development  Corporation's 
"Mini-System,"  a  continuous-speech  understanding  system  running  entirely  on  a 
Raytheon  704  minicomputer.  The  Mini-System,  which  has  been  operational  since 
May,  1973,  was  designed  as  test  platform  for  acoustic-phonetic  routines  now 
incorporated  into  an  extended  rystem  that  runs  on  an  IBM  370/145. 


2.  OVERALL  PROCESSING  PATTERN  OF  THE  MINI-SYSTEM 

2.]  THE  TOP-END 

The  Mini-System  has  a  minimal  linguistic  top-end  and  powerful  low-level 
acoustic  analysis.  The  top-end  is  a  simple  left-to-right  predictive  parser 
with  full  backtracking.  It  begins  by  predicting  all  words  that  can  begin 
an  utterance,  with  the  predicted  left  bourdary  for  the  list  set  to  the  begin¬ 
ning  of  the  utterance.  The  rest  of  the  system  then  returns  to  the  top-end  a 
subset  of  this  list  (possibly  null) ,  representing  words  judged  to  be  possible 
acoustic  matches,  along  with  a  goodness-of-match  score  and  an  estimated  ending 
time  boundary  for  each  accepted  word.  The  t»_p-end  then  predicts  the  list  of 
words  possible  syntactically  after  the  best-matching  first  word,  and  so  on. 

If  at  any  point  in  this  process  all  words  in  the  predicted  list  fail  to  match, 
the  top-end  backtracks  to  a  second-choice  word  found  earlier. 

2.2  THE  A- MATRIX 

The  Mini-System  operates  as  a  two-stage  process.  The  utterance  is  first 
analyzed  by  a  set  of  acoustic-phonetic  routines  that  produce  a  structured 
summary  called  the  A-oatrix.  The  second  stage  consists  of  the  top-end,  lexicon 
lookup,  and  mapping  programs,  which  operate  in  a  loop  until  the  utterance  is 
successfully  parsed  or  all  parsings  fail.  The  Mini-System  flow  is  illustrated 
in  Figure  1. 
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The  A-matrix  is  a  two-dimensional  array  that  contains  the  results  of  acoustic- 
phonetic  analysis.  The  speech  signal  is  divided  into  10-msec,  segments.  For 
each  segment,  a  set  of  labels  and  probabilities  is  created.  The  labels  may  be 
the  names  of  specific  phonemes,  groups  of  phonemes,  or  spectral  classes.  Tn 
additi_  pitch,  energy,  formant,  and  other  d.ita  are  recorded  for  each  segment. 
(See  Figure  2.) 

2.3  THE  MAPPING  PROGRAM 

The  mapping  program  receives  from  the  lexicon  lookup  program  a  list  of  words, 
orthographi.cally  spelled,  with  one  or  more  phonemic  spellingr  for  each  in  a 
standard  ARP ABET  notation  (see  Appendix  2.),  and  a  beginning  segment  number. 

The  mapping  program  then  returns  a  subset,  possibly  null,  of  this  list  that 
represents  words  that  are  potentially  present  at  the  given  point  in  the  utter¬ 
ance.  In  addition,  for  those  words  judged  possible,  a  word  score,  ranging 
from  0  to  99,  is  returned  that  represents  the  relative  certainty  of  occurrence. 
The  ending  segment  number  for  the  word  as  estimated  by  the  mapping  program  is 
also  returned. 

A  problem  arises  when  multiple  phonemic  variants  of  the  same  word  are  all 
judged  to  be  possible  occurrences.  If  all  of  the  variants  end  at  approxi¬ 
mately  the  same  place,  the  best  is  chosen  and  the  others  are  ignored.  If  they 
end  at  different  places,  each  rauvt  be  kept  as  a  separate  possibility,  because 
the  top-end  will  have  to  consider  each  as  a  separate  parse  branch. 

3.  PHONEMIC- ACOUSTIC  MAPPING 

The  following  sections  describe,  in  detail,  the  precise  method  used  by  the 
Mini-System  mapping  program  to  match  phonemic  strings  against  the  A-matrix. 

The  overall  flow  uf  the  mapping  process,  illustrated  in  Figure  3,  is  to  first 
deal  with  the  word-juncture  p^oblen,  thus  obtaining  an  assumed  beginning  seg¬ 
ment  number  for  the  word,  ther  match  each  word  in  the  predicted  list  to  the 
A-matrix,  deriving  a  goodness-of-raatch  score  and  ending  time  boundary  for  each, 
and  finally  to  subset  the  list  of  words  on  the  basis  of  the  match  scores. 
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Figure  2.  Sample  Portion  of  an  A-Matrix  for  the  Beginning  of  the  Word  "EXPLAIN 


Figure  3.  The  Mapping  Process 
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3.1  FINDING  THE  ASSUMED  BEGINNING  OF  THE  WORD 

The  mapping  program  receives  from  the  top-end  a  segment  number  for  the  begin¬ 
ning  of  the  word.  This  number  represents  the  next  segment  after  the  end  of 
the  previous  word  found,  and  must  be  adjusted  to  take  into  account  inter¬ 
word  pauses,  trailing  aspirations,  and  other  effects  of  word  juncture.  The 
algorithm  used  is  based  solely  on  rough  segment  labels,  which  are  approximate 
spectral  labels  assigned  every  segment  base!  on  amplitude  and  zero-crossing 
data.  Five  rough  segment  labels  are  possible:  SI  (silence) ,  UV  (low-amplitude 
voiced  or  unvoiced) ,  VW  (vowel-like) ,  SS  (strong  frication)  and  VC  (all  other — 
usually  weak  voicing) .  The  following  algorithm  is  used  to  adjust  the  assumed 
beginning. 

Bl.  Ignore  Trailing  Aspiration 

If  the  next  four  segments  are  SI,  UV,  or  VC  segments,  and  are  followed 
by  at  least  six  SI  or  UV  segments,  advance  the  beginning  seven  segments. 

B2.  Skip  Over  Long  Pauses 

Advance  t  e  beginning  over  any  SI  segments,  ignoring  isolated  UV,  SS,  or 
VC  segments. 

B3.  Ignore  Tongue  Clicks 

If  the  next  five  segments  are  followed  by  at  least  eleven  consecutive 
SI  segments,  advance  the  beginning  fourteen  segments  and  go  t  ,tep  B2. 

B4 .  Back  Off 

Back  off  two  segments  unless  that  would  retreat  the  assumed  begmnina  to 
before  the  beginning  given  by  the  top-end.  This  is  to  ensure  that  plosive 
bursts  and  such  have  not  been  inadvertently  skipped  over. 
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3.2  THE  MAPPING  TABLE 

A  special  table  is  used  throughout  the  mapping  process  to  keep  track  of  inter¬ 
mediate  results.  This  is  a  sample  mapping  table  for  the  word  "PRINT": 


PHX 

STX 

CLX 

SCX 

FNX 

BX 

EX 

P 

0 

PL 

20 

1 

10 

14 

R 

0 

SO 

0 

0 

15 

17 

IH 

n 

L 

VW 

34 

1 

18 

30 

N 

0 

NA 

0 

0 

31 

38 

T 

0 

PL 

0 

0 

39 

46 

The  columns  have  the  following  meanings: 

PHX  The  name  of  the  phoneme  in  standard  two-character  ARPABET. 

STX  The  predicted  stress.  Zero  means  reduced,  one  means  unstressed,  and  two 
means  stressed.  Stress  is  meaningful  only  for  vowels. 

CLX  The  phoneme  class.  The  following  classes  are  used: 

VW:  IY,  IH,  EY,  EH,  AE,  AA,  AH,  AO,  OW,  UH ,  UW,  AX,  IX,  ER, 

AW,  AY,  OY,  EL,  EM,  EN 
NA:  K,  N,  NX 

SO:  Y,  W,  R,  L 

SS:  S,  SH,  Z 

PL:  P,  T,  K,  B,  D,  G 

XX:  All  others 

SCX  The  score,  or  probability  of  occurrence,  of  the  phoneme  as  judged  by 
the  phoneme  locators,  has  a  range  of  0  to  99.  SCX  is  meaningful  only 
if  FNX-1. 

FNX  If  FNX  is  zero,  the  phoneme  locators  have  not  yet  beer,  called  for  this 
phoneme.  If  FNX  is  one,  SCX  contains  the  phoneme  score  determined  by 
the  phoneme  locators. 

BX  These  represent  the  beginning  and  ending  10-msec  segment  numbers  for 

EX  each  phoneme.  If  FNX=1  and  SCX>0,  BX  and  EX  are  as  found  by  the  phoneme 

locators.  Otherwise,  they  are  estimated  positions  set  by  the  mapping 
progrim. 
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Havinq  located  the  assumed  beginning  of  the  word,  an  initi  is  created 

from  the  ARPABET  spelling  received  from  the  lexicon  lookup  program.  The  PHX 
column  is  \  ith  phoneme  names,  the  STX  column  is  filled  with  stress  data, 

nd  the  CLX  column  is  filled  with  the  appropriate  phoneme  classes.  SCX  and  FNX 
are  set  to  zerr.  The  beginning  and  ending  segments  for  each  phoneme,  BX  and 

EX,  are  fillsd,  starting  from  the  assumed  beginning  and  allotting  8  segments 
(80  msec.)  to  each  phoneme.  The  following  is  a  sample  initial  mappin'  .-.able 
for  the  word  "PRINT,"  using  an  assumed  beginning  segment  number  of  10: 


PHX 

SIX 

CLX 

SCX 

FNX 

BX 

EX 

P 

0 

PL 

0 

0 

10 

17 

R 

0 

SO 

0 

0 

18 

25 

IH 

2 

vv 

0 

0 

26 

33 

N 

0 

NA 

0 

0 

34 

41 

A. 

n 

PL 

0 

0 

42 

44 

3.3  THE  PHONEME  LOCATORS 

The  mapping  process  uses  a  set  of  subroutines  that  locate  a  giver.  ph<-..eme  near 
a  given  estimated  position  in  the  speech  utterance  and  return  a  p^oba- 
bility-of- occurrence  estimate  for  that  phoneme  and  location;  these  subroutines 
employ  the  acoustic  results  stored  in  the  A-matrix.  The  phoneme  locators 
perform  their  search  in  a  two-step  fashion:  first,  they  look  for  the  phoneme 
strictly  within  or  overlapping  the  interval  given  then  by  the  mapper.  If  this 
fails,  they  then  expand  their  search  a  certain  "fuzz"  distance,  which  is 
currently  set  to  three  segments.  At  Ire  present  time,  two  phoneme  locators 
are  used,  one  for  vowels  and  sonorants  and  one  for  consonants. 

The  vc-rel  and  sonorant  loc  :or  looks  for  th*»  vowels  and  the  phonemes  Y,  W,  R, 

L,  M,  N,  and  NX.  When  look  .ng  for  a  diphthv.  ;,  the  lc.ator  looks  for  an 
apprf  riate  p-.ttem  of  change  in  the  A-matrix  the  other  phonemes,  the 

decision  is  based  on  the  choices  recorded  in  the  « -matrix  and  on  considerations 
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of  acoustic  similarity.  This  locator  is  also  capable  of  considering  a  certain 
amount  of  coarticulation  effects  by  using  information  on  neighboring  phonemes 
as  recorded  in  the  mapping  table.  If  an  interval  is  given  that  is  voiced  but 
significantly  unlike  the  requested  phoneme,  a  score  of  one  is  returned. 

The  consonant  locator  looks  for  all  the  phonemes  not  covered  by  the  vowel  and 
sonorant  locator.  The  high-energy  fricatives  S,  SH,  Z,  CH,  JH,  and  ZH  are 
located  in  a  straightforward  manner.  The  stop  consonants  P,  T,  B,  K,  G,  and 
D  are  identified  in  the  A-matrix  onlj  at  the  plosive  burst;  thus,  the  locator 
must  allocate  a  certain  portion  of  a  ly  preceeding  silence  to  them.  A  low- 
energy  area  not  identified  as  a  particular  low-en  ;rgy  phoneme  will  get  a 
nominal  score  when  a  low-energy  phoneme  is  requesiei’. 


1.A  ORDERING  OF  THE  P.iONEME  SEARCH 

A  "goodness  score"  is  associated  with  each  phoneme  ir  the  mappi  table.  This 
number  is  ust  1  to  decide  which  phoneme  to  look  for  next.  Each  goodness  score 
is  proportional  to  distance  from  the  nearest  fitted  boundary,  in  phonemes,  and 
the  difficulty  of  locating  the  phoneme  by  machine  in  the  acoustic  signal.  The 
function  below  is  currently  used  to  create  these  scores.  Naturally,  as  our 
ability  to  recognize  certain  phonemes  improves,  this  function  is  changed  to 
reflect  that  fact. 


Goodness  =  (Distance  +  1) 


(Distance  +1)*2 

(Distance  +1)*3 

(Distance  +1)*5 


If  the  phoneme  is  a  stressed  vowel,  or  a 
member  of  the  SS  class,  or  an  N,  or  an  R, 
or  an  ER. 

,"f  the  phoneme  is  an  unstressed  or  reduced 
vowel,  or  a  Y,  or  a  W,  or  an  M,  or  an  NX. 

If  the  phoneme  is  a  member  of  the  PL  class, 

or  a  CH,  or  a  JH,  or  an  L. 

otherwise. 


I 
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These  scores  are  used  fo  order  the  phoneme  search.  The  phoneme  with  the  lowest 
score  is  looked  for  first,  then  the  one  with  the  second-lowest  score  is  looked 
for,  and  so  on.  In  this  way  wo  avoid  the  problems  of  a  strictly  left-to-right 
search,  which  may  stumble  over  a  difficult  phoneme,  while  on  the  other  hand 
not  plunging  so  far  into  the  word,  locking  for  an  easy  phoneme,  that  wa  give 
up  a  decent  estimate  of  the  phoneme's  location. 


3.5  THE  MAPPING  ALGORITHM 

After  the  assumed  beginning  is  calculated  and  an  initial  mapping  table  is 
created,  the  actual  phonemic-acoustic  mapping  process  is  performed. 

First,  an  anchor  phoneme  is  selected,  as  follows.  The  goodness  score  for  each 
phoneme  is  calcuTcted,  and  the  two  lowest-scoring  phonemes  are  given  to  the 
phoneme  locators,  along  with  the  beginning  and  ending  estimated  segment 
boundaries  recorded  in  the  mapping  table.  The  phoneme  returning  the  higher 
score  is  selected  as  the  anchor. 

The  anchor  phoneme  is  entered  into  the  mapping  table  as  found.  The  estimated 
phoneme  locations  of  all  the  phonemes  in  the  table  are  adjusted  to  reflect 
the  new  information  on  the  boundaries  of  the  anchor  phoneme.  Mapping  now  pro¬ 
ceeds  in  an  analogous  manner,  with  the  unlooked-for  phoneme  having  the  best 
gocdness  score  being  looked  for  next.  After  each  phoneme  is  looked  for,  the 
table  is  adjusted  co  reflect  new  boundary  data,  and  the  goodness  scores  are 
recomputed.  This  process  continues  until  all  phonemes  in  the  word  have  been 
looked  for. 

During  the  mapping,  as  each  phoneme  is  located,  the  estimated  boundaries  for 
phonemes  not  yet  located  must  be  adjusted  to  reflect  any  new  information. 

This  i3  done  as  follows.  Assume  we  have  a  partially  filled  mapping  table  with 
some  phonemes  already  found  with  definite  boundaries,  some  looked  for  but  not 
found  and  some  not  yet  looked  for.  We  are  now  given  the  result  of  a  call  to  a 


10  May  1974 


-13- 


System  Developing- 1  Corporation 
TM- 53 15/000/00 


phoneme  locator,  i.e.  a  score,  a  beginning  segment  number,  and  an  ending  seg¬ 
ment  number.  If  the  new  answer  has  a  score  of  ze^o,  meaning  that  the  phoneme 
was  not  found,  the  boundaries  estimated  previously  are  not  changed.  If  the 
new  answer  is  positive,  a  check  is  made  to  ensure  that  the  new  boundaries  are 
consistent  with  previous  results.  If  the  new  phoneme  occurs  before  the  last 
previously  found  phoneme  or  after  the  next  previously  found  phoneme,  the  new 
answer  is  rejected;  otherwise,  the  new  beginning  and  ending  segment  data  are 
entered  into  the  mapping  table.  Note  that  thin  allows  considerable  overlap 
to  occur  between  phonemes  but  that  they  are  not  allowed  to  be  out  of  order  in 
time. 

At  this  point,  boundaries  of  the  unloi ated  phonemes  must  be  adjusted.  First, 
the  boundaries  of  phonemes  that  lie  between  two  known  boundaries  aie  adjusted 
to  agree  with  the  fixed  data.  If  two  or  more  phonemes  lie  between  the  same 
fixed  boundaries,  each  is  given  an  equal  portion  of  the  availaLle  slot. 
Finally,  the  unlccated  phonemes  occurring  after  the  last  fixed  time  boundary 
and  before  the  first  fixed  time  boundary  are  shifted  forward  or  backward  to 
agree  with  that  boundary,  with  8  segments  (80  msec.)  allotted  to  each. 

At  the  end  of  the  mapping  process,  after  all  phonemes  have  been  looked  for,  a 
second  pass  is  performed.  That  is,  if  any  phonemes  were  not  found  (score  of 
zero) ,  they  are  looked  for  again,  on  the  supposition  that  estimates  of  their 
boundaries  may  have  improved  during  the  mapping  of  other  phonemes.  This 
second  pass  is  repeated  until  no  further  progress  is  made. 

At  this  point,  the  mapping  table  is  complete  and  read/  to  be  scored. 


3.6  SCORING  THE  COMPLETED  MAPPING  TABLE 

An  overall  word  score  is  dei'\ed  from  the  completed  mapping  table.  Originally, 
this  was  just  the  average  phoneme  score,  but  further  development  has  compli¬ 
cated  the  scoring  algorithm.  This  is  the  scoring  algorithm  cur  tly  in  use; 
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SI  SUM,  DVSR  -  0.  T.  -  1. 

S£  If  the  word  begins  more  than  8  segments  past  the  assutr.d  beginning, 
LViR  ic  incremented  by  1  for  each  such  8  segment.,. 

S3  Any  more  phonemes  left  to  score?  If  not,  go  to  step  S9 . 

34  Add  SCORE (PHONEME (I))  to  SUM.  Add  1  to  DVSR. 

55  IT  Inv-NEME(I)  is  a  stressed  vowel,  repeat  step  S4. 

56  If  SCORE(PHONEME(I))=0,  add  1  to  DVSR  for  each  4  segments  in  excess 
of  8  segments  of  length. 

57  If  0< S CORE (PHONEME ( I )  )  <  5 ,  add  1  to  DVSR  for  each  4  segments  in 
excess  of  12  segments  of  length. 

58  1=1+1.  Go  to  step  S3. 

59  If  LENGTH ( WORD)  is  not  at  least  3  segments  times  the  number  of 
phonemes  in  the  word,  add  1  to  DVSR  for  each  segment  under  this 
minimum. 

510  SUM=2*SUM.  DVSR=MAX(DVSR, 1) . 

511  SCORE=MIN(SUM/DVSR ,99) . 

512  Multiply  the  score  by  the  fraction  of  phonemes  in  the  word 
having  a  positive  score. 


4.  CONCLUSIONS 

The  approach  to  phonemi c-aco" s t ic  mapping  described  in  this  document  has  five 
important  advantages:  (1)  phonemes  ere  searched  for  in  an  order  that  maximizes 
the  chance  that  they  will  be  properly  found;  '2)  the  process  is  able  to  deal 
with  phoneme  duration  variations  without  resorting  to  arbitrary  thresholds  for 
minimum  and  maximum  phonem=  length;  (3)  by  performing  segmentation  in  conjunc¬ 
tion  with  recognition,  a  more  accurate  segmentation  is  possible  than  by  segmen¬ 
tation  in  isolation;  (4)  the  procedure  is  capable  of  correctly  identifying 
words  even  when  it  cannot  identify  every  single  phoneme  in  the  word — i.e. ,  it 
is  fail-soft;  and,  (5)  it  is  extensible:  to  add  a  new  word  to  the  vocabu’ ^ry, 
we  add  only  its  phonemic  spelling,  not  any  code. 

In  a  trial  run  of  2C  utterances,  the  program  was  able  to  identify  the  correct 
word  in  first  place  89%  of  the  time.  The  majority  of  errors  resulted  from 
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co-articulation  effects  between  words,  which  are  beyond  the  scope  of  the  top- 
end  of  the  Mini-System.  The  vocabulary  for  this  test  was  160  words,  and  the 
grammar  contained  3C  rewrite  rul’s. 
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AFPENDIX  1:  EXAMPLE  MAPPING  OF  THE  WORD  "EXPLAIN" 


a)  Initial 

mapping  table 

PHX 

STX 

CLX 

sex 

FNX 

BX 

EX 

(Goodness) 

AX 

0 

VW 

0 

0 

15 

22 

4  irk 

K 

0 

PL 

0 

0 

23 

30 

9 

s 

0 

SS 

0 

0 

31 

38 

4  ** 

P 

0 

PL 

0 

0 

39 

46 

15 

L 

0 

SO 

0 

0 

47 

54 

18 

EY 

2 

VW 

0 

0 

55 

62 

7 

N 

0 

NA 

0 

0 

63 

70 

3 

Score 

!  Of 

"Ax::  i 

3  29 

Score 

!  Of 

"S"  is 

60  — 

therefore 

"S"  chosen  a 

b)  Mapping 

table 

after  the 

anchor 

"S" 

is  entered. 

PHX 

STX 

CLX 

sex 

FNX 

BX 

EX 

(Goodness) 

AX 

0 

VW 

0 

0 

JO 

23 

4  ** 

K 

0 

PL 

0 

0 

24 

31 

6 

S 

0 

SS 

60 

1 

32 

44 

X 

P 

0 

PL 

0 

0 

45 

52 

6 

L 

0 

SO 

0 

0 

53 

60 

9 

EY 

2 

VI’ 

0 

c 

61 

68 

4 

N 

0 

NA 

0 

0 

69 

76 

5 

c)  The  "AX"  is  nov  searched 

for  and  entered. 

PHX 

STX 

CLX 

sex 

FNX 

BX 

EX 

(Goodness) 

AX 

0 

VW 

29 

1 

17 

22 

X 

K 

0 

PL 

0 

0 

23 

31 

6 

S 

0 

SS 

60 

1 

32 

44 

X 

P 

0 

PL 

0 

0 

45 

52 

6 

L 

0 

SO 

0 

0 

53 

6C 

9 

EY 

2 

VW 

0 

0 

61 

68 

4  ** 

N 

0 

NA 

0 

0 

69 

76 

5 

i 


anchor. 
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d)  The  program 

now  leaps  out 

for 

tie  "EY"  and 

finds  it. 

PHX 

SIX 

CLX 

sex 

FNX 

BX 

EX 

(Goodness) 

/X 

0 

VW 

29 

1 

17 

22 

X 

K 

0 

PL 

0 

0 

23 

31 

6 

S 

0 

SS 

60 

1 

32 

44 

X 

P 

0 

PL 

0 

o 

45 

54 

6 

L 

0 

SO 

0 

0 

55 

63 

6 

EY 

2 

VW 

35 

1 

64 

81 

X 

N 

0 

HA 

0 

0 

82 

89 

2  ** 

e)  The  prograr. 

now  looks  for 

and 

finds 

"N". 

PHX 

STX 

CLX 

Sw 

FNX 

BX 

EX 

(Goodness) 

AX 

0 

VW 

29 

1 

17 

22 

X 

K 

0 

PL 

0 

0 

2  ■» 

31 

6  ** 

S 

0 

SS 

60 

1 

32 

44 

X 

P 

0 

PL 

0 

0 

45 

54 

6 

J. 

0 

SO 

0 

0 

55 

6 

EY 

2 

VW 

35 

1 

64 

81 

X 

N 

0 

NA 

34 

1 

82 

93 

X 

t)  The  program 

now  returns  to  fill  in 

the  "K", 

PHX 

STX 

CLX 

sex 

FNX 

HX 

EX 

(Goodness) 

AX 

0 

VW 

29 

1 

17 

22 

X 

K 

0 

PL 

30 

1 

24 

31 

X 

S 

0 

SS 

60 

1 

32 

44 

X 

P 

J 

PL 

0 

0 

45 

54 

6  ** 

L 

0 

SO 

0 

0 

55 

63 

6 

EY 

2 

VW 

35 

1 

64 

81 

X 

N 

0 

NA 

34 

1 

82 

93 

X 
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g)  The  mapper  now  proceeds  to  the  plosive  "P". 


PHX 

LTX 

CLX 

sex 

PNX 

BX 

EX 

(Goodness) 

Ay 

0 

VW 

29 

l 

17 

22 

X 

't 

ft 

0 

PL 

30 

1 

24 

31 

X 

s 

0 

SS 

60 

1 

32 

44 

X 

p 

0 

PL 

19 

s 

J 

45 

53 

X 

L 

0 

SO 

0 

0 

54 

» 

6  ** 

EY 

2 

VW 

35 

1 

A 

04 

81 

X 

N 

0 

NA 

34 

1 

82 

93 

X 

h)  The  napper  now  looks  for  the  "L"  and  fails  to  find  it.  Thus  the  final 
mapping  table  is: 


PHX 

STX 

CLX 

sex 

INX 

BX 

EX 

AX 

0 

VW 

29 

1 

17 

22 

K 

0 

PL 

30 

1 

24 

31 

S 

0 

SS 

60 

1 

32 

44 

P 

0 

PL 

19 

1 

45 

53 

L 

0 

SO 

0 

1 

54 

63 

EY 

2 

VW 

35 

1 

6* 

81 

N 

0 

NA 

34 

1 

82 

93 

This  evaluates  to  a  final  word  score  of  51  9. 
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APPENDIX  2:  THE  ARP  ABET  COMPUTER  PHONEMIC  ALPHABET 


9/1/73 


( 


- r 

Computer  Renrcrertation  1 

Example 

Computer  r-.epresei  tation 

- : - j 

Phoneme 

-Character 

2 -Characters 

-Character 

2-Characters 

i 

i 

IY 

beat 

P 

P 

P 

j>et 

I 

I 

IH 

bit 

t 

t 

T 

ten 

e 

e 

EY 

bait 

k 

k 

K 

kit 

t 

E 

EH 

bet 

b 

b 

B 

bet 

A 

9 

A£ 

bat 

d 

d 

D 

debt 

« 

a 

AA 

Bob 

9 

9 

Cy 

g_et 

A 

A 

AH 

hut 

h 

h 

HH 

hat 

D 

o 

AO 

bought 

£ 

f 

F 

fat 

o 

o 

CM 

boat 

e 

T 

TH 

thing 

0 

u 

OH 

book 

S 

8 

- 

sat 

u 

u 

W 

boot 

s  or  / 

s 

SH 

shut 

a 

X 

AX 

about 

V 

V 

V 

vat 

i 

X 

IX 

roses 

* 

D 

DH 

that 

y 

R 

ER 

bird 

X 

7 

206 

i  O  or  ew 

W 

AM 

down 

z  or) 

Z 

ZH 

a£ure 

f I  or  ay 

Y 

AY 

Pil 

V 

c 

C 

CH 

church 

ol  or  ay 

0 

OY 

bo* 

J 

1 

Jl! 

judge 

y 

y 

Y 

you 

M 

H 

WH 

which 

W 

W 

w 

wit 

syl  1,1 

L 

EL 

battle 

r 

r 

R 

rent 

syl  m,* 

M 

EM 

bottom 

i 

i 

L 

let 

syl  n,n 

N 

EN 

button 

ra 

m 

M 

met 

flapped  t,I 

F 

DX 

batter 

n 

n 

N 

net 

glottal  Stoft 

c 

Q 

n 

G 

NX 

»lng 

silence 

- 

- 

non-speech 

segment 

1 

1 

laugh,  etc. 

AUXILIARY  SYMBOLS  (1-  AND  2-CHARACTER  CODES  ARE  IDENTICAL)  S 

Symbol 

Meaning 

Symbol 

Meaning 

♦ 

Morpheme  bovndary 

:3  or  . 

Fall-rise  or  non-term  juncture 

/ 

Word  boundary 

4  t# 

Comment  (anything  except*  or**) 

• 

Utterance  boundary 

1  1 

ApOS< 

-surround  special  symbol  j 

1  in  comment 

I 

Tone  group  boundary 

:  1  or  . 

Falling  or  dec 1 .  juncture 

<  ) 

Phoneme  class  information 

•■2  or  ? 

Rising  or  inter,  juncture 

<  > 

Phonetic  or  allohonic  escape  j 

j  STRESS  RFP RESENTAT J ONS  (IF 

PRESENT 

MUST  IMMEDIATELY  FOLLOW  THE  VOWEL) 

J 

Stress 

Assignment 

1 

o 

■  v  >r 

■ 

3 

I  Tertiary  stress 

X 

* 

(Etc 

.) 

2 

t 

NOTE t  Space*  are  ignored  except  within  eacapes . 
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